An automatic approach for ontology-based feature extraction from heterogeneous textualresources

نویسندگان

Carlos Vicient

David Sánchez

Antonio Moreno

چکیده

Data mining algorithms such as data classification or clustering methods exploit features of entities to characterise, group or classify them according to their resemblance. In the past, many feature extraction methods focused on the analysis of numerical or categorical properties. In recent years, motivated by the success of the Information Society and the WWW, which has made available enormous amounts of textual electronic resources, researchers have proposed semantic data classification and clustering methods that exploit textual data at a conceptual level. To do so, these methods rely on pre-annotated inputs in which text has been mapped to their formal semantics according to one or several knowledge structures (e.g. ontologies, taxonomies). Hence, they are hampered by the bottleneck introduced by the manual semantic mapping process. To tackle this problem, this paper presents a domain-independent, automatic and unsupervised method to detect relevant features from heterogeneous textual resources, associating them to concepts modelled in a background ontology. The method has been applied to raw text resources and also to semistructured ones (Wikipedia articles). It has been tested in the Tourism domain, showing promising results. & 2012 Elsevier Ltd. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Face Recognition via Local Directional Patterns

Automatic facial recognition has many potential applications in different areas of humancomputer interaction. However, they are not yet fully realized due to the lack of an effectivefacial feature descriptor. In this paper, we present a new appearance based feature descriptor,the local directional pattern (LDP), to represent facial geometry and analyze its performance inrecognition. An LDP feat...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

A Semi-automatic Ontology Learning Method for E- Learning Recourses Terminology Extraction

In this paper, we propose semi-automatic ontology learning from heterogeneous recourses method, using non-representative domain texts. The proposed method is based on natural text syntactic, morphologic and semantic analysis and it uses heterogeneous recourses for extracting knowledge. It may be used for extracting ELearning textual recourse terminology and their representation as an ontology.

متن کامل

Automatic Workflow Generation and Modification by Enterprise Ontologies and Documents

This article presents a novel method and development paradigm that proposes a general template for an enterprise information structure and allows for the automatic generation and modification of enterprise workflows. This dynamically integrated workflow development approach utilises a conceptual ontology of domain processes and tasks, enterprise charts, and enterprise entities. It also suggests...

متن کامل

Automatic Workflow Generation and Modification by Enterprise Ontologies and Documents

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Eng. Appl. of AI

دوره 26 شماره

صفحات -

تاریخ انتشار 2013

An automatic approach for ontology-based feature extraction from heterogeneous textualresources

نویسندگان

چکیده

منابع مشابه

Automatic Face Recognition via Local Directional Patterns

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

A Semi-automatic Ontology Learning Method for E- Learning Recourses Terminology Extraction

Automatic Workflow Generation and Modification by Enterprise Ontologies and Documents

Automatic Workflow Generation and Modification by Enterprise Ontologies and Documents

عنوان ژورنال:

اشتراک گذاری